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Abstract — Adaptive inverse models find applications in 
communication and magnetic channel equalization, recovery 
of digital data and adaptive linearization of sensor 
characteristics. In presence of outliers in the training signal, 
the model accuracy is severely reduced. In this paper three 
robust inverse models are developed by recursively minimizing 
robust norms using BFO based learning rule. The performance 
of these models is assesses through simulation study and is 
compared with those obtained by standard squared norm based 
models. It is in general, observed that the Wilcoxon norm 
based model provides best performance. Moreover the squared 
error based model is observed to perform the worst. 

Index Terms — Adaptive inverse models, Robust norms, Robust 
adaptive inverse model, Bacterial foraging optimization 

I. Introduction 

In digital data communication systems, high speed data 
transmission over band limited channel often causes inter- 
symbol interference (ISI) due to adverse effect of dispersive 
channel [1]. The performance of linear channel equalizers is 
poor especially when the nonlinear distortion is severe [2]. 
In these cases, nonlinear equalizers are preferable [l]-[6]. 
Since artificial neural network (ANN) can perform complex 
mapping between its input and output space, different ANNs 
have been successfully used in nonlinear inverse modeling 
problem [l]-[6]. Functional link artificial neural network 
(FLANN) possess a low complexity structure and has been 
employed as an adaptive inverse model [7]-[8]. Recently, 
genetic algorithm (GA) has been used for nonlinear and blind 
channel equalization [9]-[10]. The operations in GA such as 
the crossover and the mutation, help to avoid local minima 
problem and thus provide improved solution. However there 
are some situations when the weights in GA are trapped to 
local minima. The bacterial foraging optimization (BFO) [1 1] 
like GA is a derivative free optimization technique and acts as 
an useful alternative to GA. The number of parameters that 
are used for searching the total solution space is higher in 
BFO compared to those in GA but on the other hand requires 
less number of computations. 



The possibility of BFO being trapped to local minimum is less. 
In recent years the BFO has been proposed and has been 
applied to many areas such as harmonic estimation of power 
system signals [12] and adaptive inverse modeling [13]. In 
case of derivative free algorithms conventionally the mean 
square error (MSE) is used as the fitness or cost function. 
Use of MSE as cost function leads to improper training of 
the parameters of adaptive model when outliers are present 
in the desired signal. The traditional regressors employ least 
square fit which minimizes the Euclidean norm of the error, 
while the robust estimator is based on a fit which minimizes 
another rank based norm called Wilcoxon norm [14]. It is 
reported that linear regressors developed using Wilcoxon 
norm are robust against outliers. Using this new norm robust 
machines have recently been proposed for approximation of 
nonlinear functions [15]. Inverse modeling finds many 
applications such as channel equalization in 
telecommunication [8], adaptive linearization in intelligent 
sensors [16] and digital control [17]. In all these applications 
the learning tools used are derivative based such as least 
mean square (LMS) or back propagation (BP) algorithms. 
When outliers are present in the training signals the inverse 
modeling performance of these algorithms degrades 
substantially. To the best of our belief no literature has dealt 
on the problem of development of robust inverse model in 
presence of outliers. Therefore our motivation in this paper 
is to address this problem and to provide effective solution 
using BFO based training of the models using a robust norms 
of error [15, 18, 19] as cost function. The choice of BFO as a 
training tool is due to its advantages over other evolutionary 
computing tool stated earlier. The paper is organized into 
following sections : Section II discusses inverse modeling 
problem. Robust adaptive inverse model is given in Section 
III. In Section IV the BFO based update algorithm is developed 
for the inverse model. Development of robust inverse model 
using BFO based training and robust norm minimization is 
dealt in Section V. For performance evaluation, the simulation 
study is carried out and is dealt in Section VI. Finally 
conclusion of the paper is outlined in Section VII. 
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U. ADAPTIVE INVERSE MODEL 
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Fig.l. Development of a robust inverse model using BFO based 
training cost function 

The input symbols are represented as x(k) at time 

instance, fc . They are then passed to a channel (plant) which 
may be linear or nonlinear. An FIR filter is used to model a 
linear channel whose output at time instant fc maybe written 

as 



N -1 



y(k) = ^w(i)x(k-i) 



(l) 



!=0 



where w(i) are the weight values and A/ is the length of 

the FIR plant. The "NL" block represents the nonlinear 
distortion introduced in the channel and its output may be 
expressed as a nonlinear function of input and channel 
coefficients. 



z(k) = y/(x(k),x(k-l), ,x(k -N +1) 



(2) 



w(0), w(l), , w(N - 1)), 

where y/(.) is some nonlinear function generated by the "NL" 
block. The channel output z(k) is corrupted with additive 
white Gaussian noise q(k) of variance <j 2 . This corrupted 
received output is given by r(k) . The received signal 
r(k) is then passed into the adaptive inverse model to 

produce x(k) which represents the input symbols x(k) . 

From initial parameters, the weights are updated until the 
conventional cost function 

A k=\ 



(3) 



is minimized. The term A/ stands for input samples used for 

training and the error term e(k) = X d (k) — x(k) . The 

received data is given by rik) = z(k) + C](k) . 
Conventionally the minimization of this cost function is 
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performed iteratively by derivative based LMS algorithm. 

EI. ROBUST ADAPTIVE INVERSE MODEL 

Three robust cost functions (RCF) defined in literature [15, 
18, 19] are chosen in the development of robust adaptive 
inverse models. The RCFs are defined as 

(a) Robust Cost Function-1 (Wilcoxon Norm) [14,15] 
The Wilcoxon cost function is a pseudo-norm and is 

defined as CF l = ^a(i?(v,))v,. = ^a(/)v,. (4) 

/=i i=i 

where i?(v ; ) denotes the rank of v ■ among 
v l ,v 2 , , Vj, v (1) < v (2) < V (;) are the ordered 

values of v,,v 2 , ,V l , a(i) = <f)[i 1(1 + 1)] . In 

statistics, different types of score functions have been dealt 
but the commonly used one is given by 



0OO = V12(m-O.5). 

(b) Robust Cost Function-2 [18] 
It is defined as 

CF 3 = ct(1 - exp(-e 2 / 2a)) (5) 

where c is a parameter to be adjusted during training and 
g 2 is the mean square error 

(c) Robust Cost Function-3 (Mean Log Squared error)[19] 

The next cost function is defined as 

CF 4 =log(l + ^-) (6) 

where is mean square error. 

The weight-update mechanism of inverse model of Fig. 1 is 
carried out by minimizing the cost functions of the errors 
defined in (4), (5) and (6) using BFO algorithm. 

IV. DEVELOPMENT OF ROBUST NONLINEAR INVERSE MODELS USING 

BFO 

The updating of the weights of the BFO based inverse 
model is carried out using the training rule as outlined in the 
following steps: 
Step-1 Initialization of parameters 

(i) S b = No. of bacteria to be used for searching the total 

region 

(ii) A/ = Number of input samples 

(iii) p = Number of parameters to be optimized 

(iv) N = Swimming length after which tumbling of bacteria 

is undertaken in a chemotactic loop. 

(v) N =Number of iterations carried out in a chemotactic 



loop, N > . 
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(vi) = Maximum number of reproduction loop 

(vii) = Maximum number of elimination and dispersal loop. 

(viii) P , = Probability with which the elimination and 

dispersal takes place. 

(ix) The location of each bacterium p (1- p , 1- S h , 1) is 

specified by random numbers on [0,1]. 

(x) The value of run length unit, C(i) is assumed to be 

constant for all bacteria. 

Step-2 Generation of model input 

(i) Random binary input [ 1 ,- 1] is applied to the channel/plant, 
(ii) The output of the channel is contaminated with white 
Guassian noise of known strength to generate the input signal, 
(iii) The binary input is delayed by half of the order of the 

inverse model to obtain the signal, X d (k) . 

Step -3 Weight update algorithms 

In this step the bacterial population, chemotaxis, reproduction, 

elimination and dispersal are modeled. Initially j = k = I = 

(i) Elimination dispersal loop / — / + 1 

(ii) Reproduction loop k = k +\ 

(iii) Chemotaxis loop 7 = 7+1 

(a) For f=l,2, S b , the cost function J(i,j,k,l) for 

each i th bacterium is evaluated by the following way : 

(1) A/ number of random binary samples are generated , 
passed through the model and the output is computed. 
(2)The output is then compared with the corresponding 

training signal, X d (k) to calculate the error, e(k) . 
(3)The robust cost functions of the error terms are computed 

as . J(ij,kJ) 

(4)End of For Loop. 

(b)For> = U Athe tumbling/swimming decision is taken. 

Tumble : A random vectcA(i") ; with each element, 4JS *=U p, 

in the range of [-1, 1] is generated. The new position of i :h 
bacterium in the direction of tumble is computed as 

f 1 (j -i-ijt. o = J=" U *- ■ ODO v , — 

co 

The updated position is used to compute the new cost 

function (mean squared error) J(j. j — L jfc, /) - 

Swim - (i) Let c (counter for swim length) = 

(ii) Whilee < JV, , hat is bacteria have not climbed down too 

long then update c = e + 1 

If J~(jy < J{j— 1) then the new position of th bacterium is 

computed by using (7). The updated position, 

P(j + l,k,l) is used to compute the new cost function, 

J(i,j + l,k,l) 

ELSE c = N s . (End of the WHILE statement). 

(c)Go to next bacterium (;' + 1) if i^S b the process is 
repeated. 
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(d)If min( J ) { minimum value of J among all the bacteria } is 
less than the tolerance limit then all loops are broken. 
Step-4. If 7 < N c ,go to (iii) to continue chemotaxis loop since 
the lives of bacteria are not over. 

Step-5 Reproduction 

(a) For the given fc and /, and for each 

Z = 1,2, jS^the bacteria are sorted in 

ascending order of cost functions, j (higher cost function 
means lower health). 

(b) The first half of bacteria population is allowed to die and 
the remaining half bacteria with least cost functions split into 
two and the copies are placed at the same location as their 
parent. 

Step-6. If k < N go to Step-2. In this case, the number of 

specified reproduction steps has not reached and the next 
generation in the chemotactic loop is started. 

Step-7. Elimination -Dispersal 

The bacterium with an elimination-dispersal probability above 

a preset value P ed is eliminated by dispersing to a random 

location and new replacements are randomly made in the 
search space. By this approach the total population is 
maintained constant. 

V Development of robust inverse model using bfo based 

TRAINING 

Three robust cost functions defined in literature [15, 18, 
19] are used to develop the robust inverse models. The BFO 
is then used to iteratively minimize these cost functions of 
the error obtained from the model. The resulting inverse 
model is expected to be robust against outliers. The weight- 
update of inverse model of Fig. 1 is carried out by minimizing 
these cost functions of the errors defined in (4), (5) and (6) 
using BFO algorithm. 

In this approach, the procedure outlined in Steps- 1 to 7 of 
Section IV remains the same. The only exception is detailed 
as follows : 

Let the error vector of p th bacterium at k th generation due 
to application of TV input samples to the model be 

represented as [e lp (k),e 2p (k), ,e Np (k)f. 

The errors are then arranged in an increasing manner from 

which the rank R{e„ „(k)} of each nth error term is 

obtained. The score associated with each rank of the error 
term is evaluated as 



a(0 = Vl2(— 0.5) 

N + 1 



(8) 



where (1 < i < N) denotes the rank associated with each 
error term. At jc th generation of each p th particle the 



Wilcoxon norm is then calculated as 
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C p (k) 



X a (o 



«,,(*) 



(9) 



Similarly other two robust CFs are also computed using (5) 
and (6). The learning process continues until the CF 
decreases to the possible minimum values. At this stage the 
training process is terminated and the resulting weight vector 
represents the final weights of the inverse model 



VI. 



SIMULATION STUDY 



In this section, the simulation study of the proposed 
inverse model in presence of 10% to 50% of outliers in the 
training signal is carried out. Fig. 1 is simulated for various 
nonlinear channels using the algorithm given in Sections V 
and VI. The transfer functions of three standard linear systems 
used in the simulation study are : 

//, (z) : 0.209 + 0.995z~' + 0.209z 2 

H 2 (z) : 0.260 + 0.930z~' + 0.260z~ 2 (1Q) 

H i (z) : 0.304 + 0.903z "' + 0.304z~ 2 

The eigen-value-ratio (EVR) of the plants given in (10) are 
6.08, 11.12 and 21.71 respectively [8] which indicate the 
complexity of the plant or channel. A higher EVR indicates a 
bad channel in the sense that the convergence of the model 
becomes poor. To study the effect of nonlinearity on the 
inverse model performance, two different nonlinearities are 
introduced 



Nil : z(k) = tanh(y(/fc)) 

NI2 :z(k) = y(k) + 0.2/ (k) - 0. ly 3 (k) 



(11) 



where y(k) is the output of each of linear systems defined in 
(10). The additive noise in the channel is white Gaussian with 
-30dB strength. In this study an 8-tap adaptive FIR filter is 
used as an inverse model. The desired signal is generated by 
delaying the input binary sequence by half of the order of 
the inverse model. Outliers are added by simply replacing the 
bit value from 1 to - 1 or - 1 to 1 at randomly selected locations 
(10% to 50%) of the desired signal. In this simulation study, 

we have used the following parameters of BFO : S b =8, N h 

= 100, p=S, N s =3, N c =5, A^=40-60, A^=10, P, ( , = 0.25, 

C(i) = 0.0075. This selection of parameters is based on 

achieving best inverse modeling performance through 
simulation. 

The bit-error-ratio (BER) plot of BFO trained inverse model 
pertaining to different nonlinear plants with different cost 
functions in presence of 0%-50% of outliers are obtained 
through simulation. The BER plots with 50% outliers only 
are shown in the Figs.2(a)-(f). In these figures the BER plots 
of four cost functions have been compared. To study the 
effects of EVR of the plant on BER performance the SNR is 
set at 15dB in presence of 0% and 50% outliers in the training 
signal and the results are shown in Fig. 3 for 50% outliers. 
Few notable observations obtained from these plots are : 
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(a)Keeping the CF, SNR and percentage of outliers in the 

training signal same, the BER increases with increase in the 

EVR of the channel. Similarly under identical simulation 

conditions the squared error cost function based inverse 

model performs the worst where as the Wilcoxon norm based 

model provides the best performance (least BER). 

(b)As the outliers in the training signal increases the Wilcoxon 

norm based inverse model continues to yield lowest BER 

compared to those provided by other norms. 

(c)With no outlier present in the training signal, the BER plot 

of all four CFs are almost same 

(d)In presence of high outliers the conventional CF2 based 

model performs the worst followed by CF4 based model. In 

all cases the Wilcoxon norm (CF1) based inverse model 

performs the best and hence is more robust against low to 

high outliers in the training signal. 

(e)The accuracy of inverse model based on CF3 and CF4 

norms developed using outliers is almost identical. 

(f)In addition, the plots of Fig. 3 indicate that at 50% outliers 

in the training signal the BER increases with increase in the 

EVR of the nonlinear plants. 

(g) Further, the BER of the inverse models of all plants and 

SNR conditions is highest in square error norm (CF2) based 

training compared to three other robust norms used. However 

the Wlcoxon norm (CF1 ) based inverse model yields minimum 

BER among all cases studied. 
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Fig. 2(a) Comparison of BER of four different CFs based nonlinear 

equalizers with [.209, .995, .209] as channel coefficients and NL1 

with 50% outliers 



r F F F 1 


— + — C--I 
cr-3 










■ •- ~^&~~*"^I ■ ' ' 


--[""^|r*^"""": ~~; ~ 


! 1 1 1 1 ll 1 1 1 1 i^Mil lcTt*i|liJ 1 1 1 111 1 1 1 1 1 




■ aaaaaaaaaaaaaa a^a la aaaaaaaaaaaaaa 3a a 




i~ r i 1 "^, " ' 


r r W ■ ■ 






::::::^:::::^::: z z3 z ic::|::::::|:::::: 


i i i"T "X i i 


! ! ! \ v ! ! 



Fig. 2(b) Comparison of BER of four different CFs based nonlinear 

equalizers with [.209, .995, .209] as channel coefficients and NL2 

with 50% outliers 
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Fig. 2(c) Comparison of BER of four different CFs based nonlinear 

equalizers with [.260, .930, .260] as channel coefficients and NL1 

with 50% outliers 




Fig. 2(d) Comparison of BER of four different CFs based nonlinear 

equalizers with [.260, .930, .260] as channel coefficients and NL2 

with 50% outliers 
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Fig. 2(f) Comparison of BER of four different CFs based nonlinear 

equalizers with [.304, .903, .304] as channel coefficients and NL2 

with 50% outliers 
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Fig. 2(e) Comparison of BER of four different CFs based nonlinear 

equalizers with [.304, .903, .304] as channel coefficients and NL1 

with 50% outliers 
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equalizers in presence of 50% outliers 
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VII. CONCLUSION 

This paper examines and evaluates the learning capability 
of different robust norms of error when the training signal (of 
the inverse model) is contaminated with strong outliers. To 
facilitate such evaluation different nonlinear plants with 
varying EVRs are used. The population based BFO learning 
tool is developed to minimize four different norms. The 
robustness of these norms is assessed through extensive 
simulation study. It is in general observed that the 
conventional squared error norm (CF2) is least robust to 
develop inverse models of nonlinear systems under varying 
noise conditions whereas the Wilcoxon norm (CF1) is the 
most robust one. In terms of quality of performance, the norms 
are grouped in the order CF1 , CF3, CF4 and CF2. 
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